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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3559 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 


ft 1 


(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 16 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 211.. 795 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 800.. 1309 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1324.. 2091 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2122.. 2718 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2721.. 3515 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

A CTC TTT AGC ATT TTC TAGGATTTTT TAGGAGCAAC GCTCTTAGAT CCTTAGTTTT 56 
Leu Phe Ser lie Phe 
1 5 

TAGCTCTCTG ATTTTTTGTT TATCAAAAAA TTGGGGGCTT TTTTTGTTTT TATTTTTTGT 116 

CAATTTACTA TTTTTCTTTA TGATTAGCTC AAGCAACAAA AGTTATTCGT AAGGTGCGTT 176 

TGTTGTAAAA ATTTTTGTTT GGAAGGAAAA GGCA ATG CTA GGA CTT GTA TTG 228 

Met Leu Gly Leu Val Leu 
1 5 

TTA TAT GTT GGG ATT GTT TTA ATC AGC AAT GGG ATT TGC GGG TTA ACC 276 
Leu Tyr Val Gly lie Val Leu lie Ser Asn Gly He Cys Gly Leu Thr 


3? 


10 15 20 

AAA GTC GAT CCT AAA AGC ACT GCG GTG ATG AAC TTT TTT GTG GGT GGG 324 
Lys Val Asp Pro Lys Ser Thr Ala Val Met Asn Phe Phe Val Gly Gly 
25 30 35 

CTC TCC ATT ATT TGT AAT GTG GTT GTC ATC ACT TAT TCC GCT CTC AAC 372 
Leu Ser lie lie Cys Asn Val Val Val lie Thr Tyr Ser Ala Leu Asn 
40 45 50 

CCT ACA GCC CCT GTA GAA GGT GCT GAA GAT ATT GCT CAA GTA TCA CAC 420 
Pro Thr Ala Pro Val Glu Gly Ala Glu Asp He Ala Gin Val Ser His 
55 60 65 70 

pkT TTG ACT AAT TTC TAT GGG CCA GCG ACT GGG TTA TTG TTT GGT TTC 468 
His Leu Thr Asn Phe Tyr Gly Pro Ala Thr Gly Leu Leu Phe Gly Phe 
75 80 85 

ACC TAC TTG TAT GCG GCT ATC AAC CAC ACT TTT GGT TTG GAT TGG AGG 516 
Thr Tyr Leu Tyr Ala Ala He Asn His Thr Phe Gly Leu Asp Trp Arg 
90 95 100 

CCC TAC TCT TGG TAT AGC TTA TTC GTA GCG ATC AAC ACG ATT CCT GCT 564 
Pro Tyr Ser Trp Tyr Ser Leu Phe Val Ala He Asn Thr He Pro Ala 
105 110 115 

GCG ATT TTA TCC CAC TAT AGC GAT ATG CTT GAT GAC CAC AAA GTG TTA 612 
Ala He Leu Ser His Tyr Ser Asp Met Leu Asp Asp His Lys Val Leu 
120 125 130 

GGC ATC ACT GAA GGC GAT TGG TGG GCG ATC ATT TGG TTG GCT TGG GGT 660 
Gly He Thr Glu Gly Asp Trp Trp Ala He He Trp Leu Ala Trp Gly 
135 140 145 150 

GTT TTG TGG CTT ACC GCT TTC ATT GAA AAC ATC TTG AAA ATC CCT TTA 708 
Val Leu Trp Leu Thr Ala Phe He Glu Asn He Leu Lys He Pro Leu 
155 160 165 

GGG AAA TTC ACT CCA TGG CTT GCT ATC ATT GAG GGC ATT TTA ACC GCT 756 
Gly Lys Phe Thr Pro Trp Leu Ala He He Glu Gly He Leu Thr Ala 
170 175 180 

TGG ATC CCT GCT TGG TTA CTC TTT ATC CAA CAC TGG GTG TGAG ATG ATC 805 
Trp He Pro Ala Trp Leu Leu Phe He Gin His Trp Val Met He 

185 190 195 1 

ATA GAG CGT TTA ATA GGC AAT CTA AGG GAT TTA AAC CCC TTG GAT TTC 853 
He Glu Arg Leu He Gly Asn Leu Arg Asp Leu Asn Pro Leu Asp Phe 
5 10 15 

AGC GTG GAT TAT GTG GAT TTG GAA TGG TTT GAA ACG AGG AAA AAA ATC 901 
Ser Val Asp Tyr Val Asp Leu Glu Trp Phe Glu Thr Arg Lys Lys He 



20 


25 


30 


GCT CGC TTT AAA ACC AGG CAA GGC AAA GAC ATA GCC GTA CGC CTT AAA 
Ala Arg Phe Lys Thr Arg Gin Gly Lys ksp lie Ala Val Arg Leu Lys 
35 40 45 50 


949 


GAC GCT CCC AAG TTG GGT TTC TCT CAA GGA GAT ATT TTA TTT AAA GAA 
Asp Ala Pro Lys Leu Gly Phe Ser Gin Gly Asp lie Leu Phe Lys Glu 
55 60 65 


997 



AAG GAA ATT ATC GCC GTT AAT ATC TTG GAT TCT GAA GTC ATT CAC 
Lys Glu lie lie Ala Val Asn lie Leu Asp Ser Glu Val lie His 
70 75 80 


CAA GCT AAG AGC GTG GCA GAA GTA IGCG AAA ATA TGC TAT GAA ATA 
Gin Ala Lys Ser Val Ala Glu VallAla Lys lie Cys Tyr Glu lie 
85 90 95 

AAC CGC CAT /GCG GCT TTA TAC TAT GGC GAG TCT CAA TTT GAA TTT 
Asn Arg His^Ala Ala Leu Tyr Tyr Gly Glu Ser Gin Phe Glu Phe 
100 105 110 


1045 


1093 


1141 


AAA ACA CCA TTT GAA AAG CCC ACG CTA GCG TTA CTA GAA AAG CTA GGG 
Lys Thr Pro Phe Glu Lys Pro Thr Leu Ala Leu Leu Glu Lys Leu Gly 
115 120 125 130 


1189 


GTT CAA AAT CGT GTT TTA AGT TCA AAA TTG GAT TCC AAA GAA CGC TTA 
Val Gin Asn Arg Val Leu Ser Ser Lys Leu Asp Ser Lys Glu Arg Leu 
135 140 145 


1237 


ACC GTG AGC ATG CCC CAT AGT GAG CCT AAT TTT AAG GTC TCA CTG GCG 
Thr Val Ser Met Pro His Ser Glu Pro Asn Phe Lys Val Ser Leu Ala 
150 155 160 


1285 


AGC GAT TTT AAA GTG GTC ATG AAA TAGAAAAACA ACAA ATG GAT AAA GGA 
Ser Asp Phe Lys Val Val Met Lys Met Asp Lys Gly 

165 170 1 


1335 


AAA AGC GTG AAA AGC ATT GAA AAA AGC GTG GGT ATG CTC CCA AAA ACT 
Lys Ser Val Lys Ser lie Glu Lys Ser Val Gly Met Leu Pro Lys Thr 
5 10 15 20 


1383 


CCA AAG ACA GAC AGC AAT GCT CAT GTG GAT AAT GAA TTT CTG ATT CTG 
Pro Lys Thr Asp Ser Asn Ala His Val Asp Asn Glu Phe Leu lie Leu 
25 30 35 


1431 


CAA GTC AAT GAT GCG GTG TTC CCC ATT GGA TCT TAC ACG CAT TCT TTT 
Gin Val Asn Asp Ala Val Phe Pro lie Gly Ser Tyr Thr His Ser Phe 
40 45 50 


1479 


GGG CTT TTG GCT AGA AAC TTA CAT CCA GCA AAA AAG GTT ACT AAT AAA 
Gly Leu Leu Ala Arg Asn Leu His Pro Ala Lys Lys ,Val Thr Asn Lys 


1527 



55 60 65 

GAA AGC GCT TTA AAA TAT TTA AAA GCC AAT CTC TCT AGC CAG TTC CTT 1575 
Glu Ser Ala Leu Lys Tyr Leu Lys Ala Asn Leu Ser Ser Gin Phe Leu 
70 75 80 

TAC ACG GAA ATG CTG AGC TTG AAA CTC ACC TAT GAA AGC GCT CTC CAA 1623 
Tyr Thr Glu Met Leu Ser Leu Lys Leu Thr Tyr Glu Ser Ala Leu Gin 
85 90 95 100 

CAA GAT TTA AAA AGG ATC TTA GGG GTT GAA GAA ATC ATT ACG CTA TCC 1671 
Gin Asp Leu Lys Arg lie Leu Gly Val Glu Glu lie lie Thr Leu Ser 
105 110 115 

ACA AGC CCC ATG GAA TTG CGA TTA GCC AAT CAA AAG CTA GGC AAT CGT 1719 
Thr Ser Pro Met Glu Leu Arg Leu Ala Asn Gin Lys Leu Gly Asn Arg 
120 125 130 

TTC ATT AAA ACC TTA CAA GCC ATG AAC GAA TTA GAC ATT GGC GCA TTT 1767 
Phe lie Lys Thr Leu Gin Ala Met Asn Glu Leu Asp lie Gly Ala Phe 
135 140 145 

TTT AAC GCT TAC GCT CAA CAA ACC GAA GAC CCC ACC CAT GCC ACT AGC 1815 
Phe Asn Ala Tyr Ala Gin Gin Thr Glu Asp Pro Thr His Ala Thr Ser 
150 155 160 

TAT GGC GTT TTT GCG GCG AGT TTG GGG ATT GAA TTG AAA AAG GCT TTA 1863 
Tyr Gly Val Phe Ala Ala Ser Leu Gly lie Glu Leu Lys Lys Ala Leu 
165 170 175 180 

AGG CAT TAT CTT TAT GCA CAA ACT TCT AAC ATG GTA ATT AAC TGC GTT 1911 
Arg His Tyr Leu Tyr Ala Gin Thr Ser Asn Met Val lie Asn Cys Val 
185 190 195 

AAA AGC GTC CCA CTA TCT CAA AAC GAT GGG CAA AAA ATC TTA TTG AGC 1959 
Lys Ser Val Pro Leu Ser Gin Asn Asp Gly Gin Lys lie Leu Leu Ser 
200 205 210 

TTG CAA AGC CCT TTT AAC CAG CTC ATA GAA AAA ACC CTA GAA CTA GAC 2007 
Leu Gin Ser Pro Phe Asn Gin Leu lie Glu Lys Thr Leu Glu Leu Asp 
215 220 225 

GAA AGC CAC TTG TGC GCG GCA AGC GTT CAA AAC GAC ATT AAG GCG ATG 2055 
Glu Ser His Leu Cys Ala Ala Ser Val Gin Asn Asp lie Lys Ala Met 
230 235 240 

CAG CAT GAG AGT TTA TAC TCG CGC CTT TAT ATG TCT TGAATTTTAT 2101 
Gin His Glu Ser Leu Tyr Ser Arg Leu Tyr Met Ser 
245 250 255 

CTCAAATTGA AAGGAATTTT ATG GTA AAA ATT GGA GTT TGT GGT CCT GTA 2151 

Met Val Lys He Gly Val Cys Gly Pro Val 



10 


GGA AGC GGT AAA ACC GCC TTG ATT GAA GCT TTA ACG CGC CAC ATG TCA 
Gly Ser Gly Lys Thr Ala Leu lie Glu Ala Leu Thr Arg His Met Ser 

15 20 25 


2199 


AAA GAT TAT GAC ATG GCG GTC ATC ACT AAT GAT ATT TAC ACG AAA GAA 
Lys Asp Tyr Asp Met Ala Val lie Thr Asn Asp lie Tyr Thr Lys Glu 
30 35 40 


2247 


GAC GCA GAA TTT ATG TGT AAA AAT TCG GTG ATG CCA CGA GAG AGG ATC 
Asp Ala Glu Phe Met Cys Lys Asn Ser Val Met Pro Arg Glu Arg lie 
45 50 55 


2295 



ATT GGC GTA GAA ACA GGA GGC TGT CCG CAC ACG GCT ATT AGA GAA GAC 2343 
lie Gly Val Glu Thr Gly Gly Cys Pro His Thr Ala lie Arg Glu Asp 
60 65 70 

GCT TCT ATG AAT TTA GAA GCC GTA GAA GAA ATG CAT GGC CGT TTC CCT 2391 
la Ser Met Asn Leu Glu Ala Val Glu Glu Met His Gly Arg Phe Pro 
75 80 85 90 

AAT TTG GAA TTG CTT TTG ATT GAA AGC GGA GGC AGT AAC CTT TCA GCG 2439 
Asn Leu Glu Leu Leu Leu lie Glu Ser Gly Gly Ser Asn Leu Ser Ala 

95 100 105 


ACT TTC AAC CCA GAG CTA GCG GAC TTT ACG ATC TTT GTG ATT GAT GTG 2487 
Thr Phe Asn Pro Glu Leu Ala Asp Phe Thr lie Phe Val lie Asp Val 
110 115 120 

GCT GAG GGC GAT AAA ATC CCC AGA AAA GGC GGG CCA GGA ATC ACG CGT 2535 
Ala Glu Gly Asp Lys lie Pro Arg Lys Gly Gly Pro Gly lie Thr Arg 
125 130 135 

TCA GAC TTG CTT GTC ATC AAT AAG ATT GAT TTA GCC CCC TAT GTG GGA 2583 
Ser Asp Leu Leu Val lie Asn Lys lie Asp Leu Ala Pro Tyr Val Gly 
140 145 150 

GCC GAC TTG AAA GTC ATG GAA AGG GAT TCT AAA AAA ATC GCG GCG AAA 2631 
Ala Asp Leu Lys Val Met Glu Arg Asp Ser Lys Lys lie Ala Ala Lys 
155 160 165 170 

AGC CCT TTA TTT TTA CCG AAT ATC CGC GCT AAA GAA GGT TTA GAC GAT 2679 
Ser Pro Leu Phe Leu Pro Asn lie Arg Ala Lys Glu Gly Leu Asp Asp 
175 180 185 

GTG ATC GCT TGG ATC AAG CGC AAC GCT TTA TTG GAA GAT TG ATG AAC 2726 
Val lie Ala Trp lie Lys Arg Asn Ala Leu Leu Glu Asp Met Asn 
190 195 1 

ACT TAC GCT CAA GAA TCC AAG CTC AGG TTA AAA ACC AAA ATA GGG GCT 2774 
Thr Tyr Ala Gin Glu Ser Lys Leu Arg Leu Lys Thr Lys lie Gly Ala 


Kb 


10 


15 


GAC GGG CGG TGC GTG ATT GAA GAC AAT TTT TTC ACG CCC CCC TTT AAG 
Asp Gly Arg Cys Val lie Glu Asp Asn Phe Phe Thr Pro Pro Phe Lys 
20 25 30 


2822 


CTC ATG GCG CCC TTT TAC CCT AAA GAC GAT TTA GCG GAA ATC ATG CTT 
Leu Met Ala Pro Phe Tyr Pro Lys Asp Asp Leu Ala Glu lie Met Leu 
35 40 45 50 


2870 


TTA GCG GTA AGC CCT GGC TTA ATG AAA GGC GAT GCA CAA GAT GTG CAA 
Leu Ala Val Ser Pro Gly Leu Met Lys Gly Asp Ala Gin Asp Val Gin 

55 60 65 


2918 


TTG AAC ATC GGT CCA AAT TGC AAG TTA AGG ATC ACT TCG CAA TCC TTT 2966 
Leu Asn lie Gly Pro Asn Cys Lys Leu Arg lie Thr Ser Gin Ser Phe 
70 75 80 

GAA AAA ATC CAT AAC ACT GAA GAC GGG TTT GCT AGC AGA GAC ATG CAT 3 014 

Glu Lys lie His Asn Thr Glu Asp Gly Phe Ala Ser Arg Asp Met His 
85 90 95 


ATC GTT GTG GGG GAA AAC GCT TTT TTA GAC TTC GCG CCC TTC CCG TTA 
lie Val Val Gly Glu Asn Ala Phe Leu Asp Phe Ala Pro Phe Pro Leu 
100 105 110 


3062 


ATC CCC TTT GAA AAC GCG CAT TTT AAG GGC AAT ACC ACG ATT TCT TTG 
lie Pro Phe Glu Asn Ala His Phe Lys Gly Asn Thr Thr lie Ser Leu 
115 120 125 130 


3110 


CGC TCT AGC TCC CAA TTG CTC TAT AGT GAA ATC ATT GTC GCA GGG CGA 
Arg Ser Ser Ser Gin Leu Leu Tyr Ser Glu lie lie Val Ala Gly Arg 
135 140 145 


3158 


GTG GCG CGC AAT GAG TTG TTT AAA TTC AAC CGC TTG CAC ACC AAA ATC 
Val Ala Arg Asn Glu Leu Phe Lys Phe Asn Arg Leu His Thr Lys lie 
150 155 160 


3206 


TCT ATT TTA CAA GAT GAG AAA CCC ATC TAT TAT GAC AAC ACG ATT TTA 
Ser lie Leu Gin Asp Glu Lys Pro lie Tyr Tyr Asp Asn Thr lie Leu 
165 170 175 


3254 


GAT CCC AAA ACC ACC GAC TTA AAT AAC ATG TGC ATG TTT GAT GGC TAT 
Asp Pro Lys Thr Thr Asp Leu Asn Asn Met Cys Met Phe Asp Gly Tyr 
180 185 190 


3302 


ACG CAT TAT TTG AAT TTG GTG CTG GTC AAT TGC CCC ATA GAG CTG TCT 
Thr His Tyr Leu Asn Leu Val Leu Val Asn Cys Pro lie Glu Leu Ser 
195 200 205 210 


3350 


GGC GTG CGA GGA TTG ATT GAA GAG AGC GAA GGA GTG GAT GGA GCC GTG 
Gly Val Arg Gly Leu lie Glu Glu Ser Glu Gly Val Asp Gly Ala Val 


3398 



215 220 225 

AGT GAA ATC GCT AGT TCT CAT TTA TGC CTG AAA GCT TTA GCG AAA GGC 3446 
Ser Glu lie Ala Ser Ser His Leu Cys Leu Lys Ala Leu Ala Lys Gly 
230 235 240 

TCA GAA CCC TTG TTG CAT TTA AGA GAA AAA ATC GCT CGC TTT ATC ACG 3494 
Ser Glu Pro Leu Leu His Leu Arg Glu Lys lie Ala Arg Phe lie Thr 
245 250 255 

CAA ACG ATT ACG CCA AAG GTT TAAAAAACAC TTTAAAAAAG ATTATACCCT 3545 
Gin Thr lie Thr Pro Lys Val 
260 265 

TTAGTCTTTT TTAA 3559 


(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Leu Phe Ser lie Phe 
1 5 


(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Leu Gly Leu Val Leu Leu Tyr Val Gly lie Val Leu lie Ser Asn 
15 10 15 


Gly lie Cys Gly Leu Thr Lys Val 
20 

Asn Phe Phe Val Gly Gly Leu Ser 
35 40 


Asp Pro Lys Ser Thr Ala Val Met 

25 30 

He He Cys Asn Val Val Val He 
45 


Thr Tyr Ser Ala L6u Asn Pro Thr Ala Pro Val Glu Gly Ala Glu Asp 
50 55 60 

lie Ala Gin Val Ser His His Leu Thr Asn Phe Tyr Gly Pro Ala Thr 

65 - ' 70 75 80 

Gly Leu Leu Phe Gly Phe Thr Tyr Leu Tyr Ala Ala lie Asn His Thr 
85 90 95 

Phe Gly Leu Asp Trp Arg Pro Tyr Ser Trp Tyr Ser Leu Phe Val Ala 
100 105 110 

lie Asn Thr lie Pro Ala Ala lie Leu Ser His Tyr Ser Asp Met Leu 
115 120 125 


sp Asp His Lys Val Leu Gly lie Thr Glu Gly Asp Trp Trp Ala lie 
130 135 140 


) * lie Trp Leu Ala Trp Gly Val Leu Trp Leu Thr Ala Phe lie Glu Asn 
145 150 155 160 

lie Leu Lys lie Pro Leu Gly Lys Phe Thr Pro Trp Leu Ala lie lie 
165 170 175 

Glu Gly lie Leu Thr Ala Trp lie Pro Ala Trp Leu Leu Phe lie Gin 
180 185 190 

His Trp Val 
195 


(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met lie lie Glu Arg Leu lie Gly Asn Leu Arg Asp Leu Asn Pro Leu 
1 5 10 15 

Asp Phe Ser Val Asp Tyr Val Asp Leu Glu Trp Phe Glu Thr Arg Lys 
20 25 30 

Lys lie Ala Arg Phe Lys Thr Arg Gin Gly Lys Asp lie Ala Val Arg 
35 40 45 

Leu Lys Asp Ala Pro Lys Leu Gly Phe Ser Gin Gly Asp lie Leu Phe 


50 


55 


60 


Lys Glu Glu 
65 

He His He 
Glu He Gly 


Glu Phe Lys 
115 

Leu Gly Val 
130 


Lys Glu 


Gin Ala 
85 

Asn Arg 
100 


He He 
70 

Lys Ser 
His Ala 


Thr Pro Phe Glu 


Gin Asn 


Leu Thr Val Ser 

V \ 145 


Leu Ala Ser 


Asp Phe 
165 


Arg Val 
135 

Met Pro 
150 

Lys Val 


Ala Val Asn 


Val Ala Glu 
90 

Ala Leu Tyr 
105 

Lys Pro Thr 
120 

Leu Ser Ser 


His Ser Glu 


He Leu Asp Ser 
75 

Val Ala Lys He 


Glu Val 
80 

Cys Tyr 
95 


Tyr Gly Glu Ser Gin Phe 
110 


Leu Ala Leu Leu 
125 

Lys Leu Asp Ser 
140 

Pro Asn Phe Lys 
155 


Glu Lys 
Lys Glu 


Val Ser 
160 


Val Met Lys 
170 


(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Asp Lys Gly Lys Ser Val Lys Ser lie Glu Lys Ser Val Gly Met 
15 10 15 

Leu Pro Lys Thr Pro Lys Thr Asp Ser Asn Ala His Val Asp Asn Glu 
20 25 30 

Phe Leu He Leu Gin Val Asn Asp Ala Val Phe Pro He Gly Ser Tyr 
35 40 45 

Thr His Ser Phe Gly Leii Leu Ala Arg Asn Leu His Pro Ala Lys Lys 
50 55 60 

Val Thr Asn Lys Glu Ser Ala Leu Lys Tyr Leu Lys Ala Asn Leu Ser 
65 70 / 75 80 

Ser Gin Phe Leu Tyr Thr Glu Met Leu Ser Leu Lys Leu Thr Tyr Glu 
85 90 95 


47 



47 

Ser Ala Leu Gin Gin Asp Leu Lys Arg He Leu Gly Val Glu Glu He 
100 105 HO 

He Thr Leu Ser Thr Ser Pro Met Glu Leu Arg Leu Ala Asn Gin Lys 
115 120 125 

Leu Gly Asn Arg Phe He Lys Thr Leu Gin Ala Met Asn Glu Leu Asp 

• 130 135 140 

* He Gly Ala Phe Phe Asn Ala Tyr Ala Gin Gin Thr Glu Asp Pro Thr 
145 150 155 160 

VHis Ala Thr Ser Tyr Gly Val Phe Ala Ala Ser Leu Gly lie Glu Leu 
165 170 175 

Lys Lys Ala Leu Arg His Tyr Leu Tyr Ala Gin Thr Ser Asn Met Val 
180 185 190 

He Asn Cys Val Lys Ser Val Pro Leu Ser Gin Asn Asp Gly Gin Lys 
195 200 205 

He Leu Leu Ser Leu Gin Ser Pro Phe Asn Gin Leu He Glu Lys Thr 
210 215 220 

Leu Glu Leu Asp Glu Ser His Leu Cys Ala Ala Ser Val Gin Asn Asp 
225 230 235 240 

lie Lys Ala Met Gin His Glu Ser Leu Tyr Ser Arg Leu Tyr Met Ser 
245 250 255 


(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Val Lys He Gly Val Cys Gly Pro Val Gly Ser Gly Lys Thr Ala 
1 5 10 15 

Leu He Glu Ala Leu Thr Arg His Met Ser Lys Asp Tyr Asp Met Ala 
20 25 30 

Val He Thr Asn Asp He Tyr Thr Lys Glu Asp Ala Glu Phe Met Cys 
35 40 45 



Lys Asn Ser Val Met Pro Arg Glu Arg He He Gly Val Glu Thr Gly 
50 55 60 

Gly Cys Pro His Thr Ala He Arg Glu Asp Ala Ser Met Asn Leu Glu 
65 70 75 80 

Ala Val Glu Glu Met His Gly Arg Phe Pro Asn Leu Glu Leu Leu Leu 
85 90 95 

He Glu Ser Gly Gly Ser Asn Leu Ser Ala Thr Phe Asn Pro Glu Leu 
100 105 HO 

\ Ala Asp Phe Thr He Phe Val He Asp Val Ala Glu Gly Asp Lys He 
115 120 125 

V/Pro Arg Lys Gly Gly Pro Gly He Thr Arg Ser Asp Leu Leu Val He 
j(\ 130 135 140 

Asn Lys He Asp Leu Ala Pro Tyr Val Gly Ala Asp Leu Lys Val Met 
145 150 155 160 

Glu Arg Asp Ser Lys Lys He Ala Ala Lys Ser Pro Leu Phe Leu Pro 
165 170 175 

Asn He Arg Ala Lys Glu Gly Leu Asp Asp Val He Ala Trp He Lys 
180 185 190 

Arg Asn Ala Leu Leu Glu Asp 
195 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Asn Thr Tyr Ala Gin Glu Ser Lys Leu Arg Leu Lys Thr Lys He 
15 10 15 

Gly Ala Asp Gly Arg Cys Val He Glu Asp Asn Phe Phe Thr Pro Pro 
20 25 30 

Phe Lys Leu Met Ala Pro Phe Tyr Pro Lys Asp Asp Leu Ala Glu He 
35 40 45 

Met Leu Leu Ala Val Ser Pro Gly Leu Met Lys Gly Asp Ala Gin Asp 


a 


V 


50 55 60 

Val Gin Leu Asn lie Gly Pro Asn Cys Lys Leu Arg lie Thr Ser Gin 
65 70 75 80 

Ser Phe Glu Lys lie His Asn Thr Glu Asp Gly Phe Ala Ser Arg Asp 
85 90 95 

Met His lie Val Val Gly Glu Asn Ala Phe Leu Asp Phe Ala Pro Phe 
100 105 110 

Pro Leu lie Pro Phe Glu Asn Ala His Phe Lys Gly Asn Thr Thr lie 
115 120 125 

Ser Leu Arg Ser Ser Ser Gin Leu Leu Tyr Ser Glu lie lie Val Ala 
130 135 140 

Gly Arg Val Ala Arg Asn Glu Leu Phe Lys Phe Asn Arg Leu His Thr 
145 150 155 160 

Lys lie Ser lie Leu Gin Asp Glu Lys Pro lie Tyr Tyr Asp Asn Thr 
165 170 175 

lie Leu Asp Pro Lys Thr Thr Asp Leu Asn Asn Met Cys Met Phe Asp 
180 185 190 

Gly Tyr Thr His Tyr Leu Asn Leu Val Leu Val Asn Cys Pro lie Glu 
195 200 205 

Leu Ser Gly Val Arg Gly Leu lie Glu Glu Ser Glu Gly Val Asp Gly 
210 215 220 

Ala Val Ser Glu lie Ala Ser Ser His Leu Cys Leu Lys Ala Leu Ala 
225 230 235 240 

Lys Gly Ser Glu Pro Leu Leu His Leu Arg Glu Lys lie Ala Arg Phe 
245 250 255 

lie Thr Gin Thr lie Thr Pro Lys Val 
260 265 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 


£>0 



• 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 


GCGAAAATAT GCTATGAAAT AGGAAACCGC CAT 


33 


(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 


(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /note= "W is either A or T." 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TGGYAYRNNN NYYGCW 16 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) 


Ala Lys lie Cys Tyr Glu lie Gly Asn Arg His 
15 10 


SEQUENCE DESCRIPTION: SEQ ID NO: 9: 


51 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

TTGACA 
6 

(2) INFORMATION FOR SEQ ID NO: 12: 

Vv (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 


(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



TATAAT 
6 


